Hyperparameter Optimization of Ensemble Models for Spam Email Detection

نویسندگان

چکیده

Unsolicited emails, popularly referred to as spam, have remained one of the biggest threats cybersecurity globally. More than half emails sent in 2021 were resulting huge financial losses. The tenacity and perpetual presence adversary, spammer, has necessitated need for improved efforts at filtering spam. This study, therefore, developed baseline models random forest extreme gradient boost (XGBoost) ensemble algorithms detection classification spam using Enron1 dataset. then optimized grid-search cross-validation technique search hyperparameter space optimal values. performance (un-tuned) tuned both evaluated compared. impact tuning on was also examined. findings experimental study revealed that when compared with models. RF XGBoost achieved an accuracy 97.78% 98.09%, a sensitivity 98.44% 98.84%, F1 score 97.85% 98.16%, respectively. model outperformed model. is effective efficient email detection.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Model for Email Spam Detection using Hybrid of Magnetic Optimization Algorithm with Harmony Search Algorithm

Unfortunately, among internet services, users are faced with several unwanted messages that are not even related to their interests and scope, and they contain advertising or even malicious content. Spam email contains a huge collection of infected and malicious advertising emails that harms data destroying and stealing personal information for malicious purposes. In most cases, spam emails con...

متن کامل

Bayesian Hyperparameter Optimization for Ensemble Learning

In this paper, we bridge the gap between hyperparameter optimization and ensemble learning by performing Bayesian optimization of an ensemble with regards to its hyperparameters. Our method consists in building a fixed-size ensemble, optimizing the configuration of one classifier of the ensemble at each iteration of the hyperparameter optimization algorithm, taking into consideration the intera...

متن کامل

Symbiotic filtering for spam email detection

This paper presents a novel spam filtering technique called Symbiotic Filtering (SF) that aggregates distinct local filters from several users to improve the overall performance of spam detection. SF is an hybrid approach combining some features from both Collaborative (CF) and Content-Based Filtering (CBF). It allows for the use of social networks to personalize and tailor the set of filters t...

متن کامل

Neural Network Model for Email-Spam Detection

Email spam is a word that we come across in our daily life. The word spam means junk mails. The unsolicited emails that are received by any person in his/her mailbox are called spam. These junk mails are usually sent in bulk for advertising and marketing some products. This work presents a neural network approach to intrusion detection. A Multi-Layer Perceptron using Back Propagation Algorithm ...

متن کامل

Email Spam Detection Using Customized SimHash Function

E-mail communication is a narrative challenging in present days, because a problem can be done in that communication from one to other emails process generation. The problem is spam mail combination in original mail interaction. This is the major task for sending information from one to other persons, if it important to that particular person. So to solve these problems effectively traditionall...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Applied sciences

سال: 2023

ISSN: ['2076-3417']

DOI: https://doi.org/10.3390/app13031971